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INFORMATION FOR SEQ ID NO: 1 

SEQUENCE CHARACTERISTICS: 

a) LENGTH: 2700 

bj TYPE: 

c) STRANDEDNESS: 

d) TOPOLOGY: 

MOLECULE TYPE: Combined DNA and Amino Acid Sequences 

HYPOTHETICAL: No 

ANTI-SENSE: No 

FRAGMENT TYPE: 

ORIGINAL SOURCE: 

IMMEDIATE SOURCE: 

POSITION IN GENOME: 

a) CHROMOSOME/SEGMENT: 



b) MAP POSITION: 

c) UNITS: 



(ix) 



FEATURE: 



a) 
b) 
c) 
d) 



NAME /KEY: 
LOCATION: 

IDENTIFICATION METHOD: 
OTHER INFORMATION: 



(x) 



PUBLICATION INFORMATION: 



a) 
b) 
c) 
d) 
e) 
f) 

g) 

h) 

i) 

j) 
k) 



AUTHOR (S) : 

TITLE: 

JOURNAL: 

VOLUME: 

ISSUE: 

PAGE(S) : 

DATE: 

DOCUMENT NUMBER: 
FILING DATE: 
PUBLICATION DATE: 

RELEVANT RESIDUES IN SEQUENCE ID NO: 



(xi) 



SEQUENCE DESCRIPTION: SEQUENCE ID NO: 1 



ATGTATTCAAGTAGCAGAAAAAGGTGCCCGAAAACCAAATGGGCTTTGAAACTTCTTACT 300 
MYSSSRKRC PKTKWALKLLT 
GCCGCATTTTTAGCAGCGAGTCCCGCGGCGAAGAGTGCTGTTAATAACGCCTATGATGCA 360 
AAFLAAS PAAKSAVNNAY DA 
TTGATTATTGAAGCTCGCAAGGGTAAT ACTCAGCCAGCTTTGTCATGGTTTGCACTAAAA 420 
LII E AR KGNTQPALSWFAL K 
TCAGCACTCAGCAATAACCAAATTGCTGACTGGTTACAGATTGCCTTATGGGCCGGGCAA 480 
SALSNNQ IADWLQ IALWAGQ 
GATAAACAGGTTATTACCGTTTACAACCGCTACCGTCATCAGCAATTACCAGCGCGTGGT 540 
D K Q V I T V Y N R Y RH QQL PAR G 
TATGCAGCTGf CGCCGTCGCTTATCGTAACCTGCAACAATGGCAAAACTCGCTTACACTG 600 
YAAV A VA YRN LQQ W QN S L T L 
TGGCAAAAGGCGCTCTCTCTGGAGCCGCAAAATAAGGATTATCAACGGGGACAAATTTTA 660 
WQKALSLE PQNKDY QRGQIL 
ACCCTGGCAGATGCTGGTCACT ATGATACTGCGCTGGTTAAACTTAAGCAGCTTAACTCT 720 
T LA DAG H Y D.TALV KL KQLN S 
GGAGCACCGGACAAAGCCAATTTACf CGCAGAAGCCTATATCTATAAACTGGCGGGGCGT 780 
GA P DBCAN LLAEA Y I Y KLAGR 
CATCAGGATGAATT ACGGGCGATGACAGAGTCATTACCTGAAAATGCATCTACGCAACAA 840 
H Q D E LRAM T E S L P ENAS T Q Q 
TATCCCACAGAATACGTGCAGGCATTACGTAATAATCAACTTGCTGCCGCGATTGACGAT 900 
YPTEY VQALRNN QLAA AIDD 
GCCAATTTAACGCCAGATATTCGCGCTGATATTCATGCCGAACTGGTCAGACTGTCGTTT 960 
ANLTPDI R ADI HAE LVRLS F 
ATGCCTACGCGCAGTGAAAGTGAACGTTATGCCATTGCCGATCGCGCCCTCGCCCAATAC 1020 
M P T RS ES ERYAI A DRALAQ Y 
GCTGCATTAGAAATTCTGTGGCACGATAACCCAGACCGCACTGCCCAGTACCAGCGTATT 1080 
A A L E I L W H D N P D R T A Q Y Q R I 
CAGGTTGATCATCTTGGCGCGTTATTAACTCGCGATCGTTATAAAGACGTTATTTCTCAC 1140 
QVDHLG ALLTRDRYKD VISH 
TATCAGCGATTAAAAAAGACGGGGCAAATTATTCCGCCCTGGGGGCAATATTGGGTTGCA 1200 
YQRLKK.T G Q I I PPWGQYWVA 
TCGGCTTATCTCAAAGATCATCAGCCGAAAAAAGCACAGTCAATAATGACCGAGCTCTTT 1260 
SAYLK DH Q PKKAQS IM TE L F 
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TATCACAAGGAGACCATTGCCCCGGATTTATCCGATGAAGAACTTGCGGATCTCTTTTAC 1320 
Y H KE T I AP DLS D EE LADL FY 
AGCCACCTGGAGAGTGAAAATTATCCGGGCGCGCT AACTGTCACCCAACAT ACCATTAAT 1380 
S HLESENY PGAL TVTQHT I N 
ACTTCGCCGCCTTTCCTTCGGTTAATGGGCACGCCTACGAGCATCCCGAATGATACCTGG 14 4 0 
TSPPFLRLMGTPTSIPNDTW 
TTACAGGGGCATTCGTTTCTCTCAACCGTAGCAAAATATAGTAATGATCTTCCTCAGGCT 1500 - 
LQGHS FLSTVAKYSNDLPQA 
GAAATGACAGCCAGAGAGCTTGCTTATAACGCACCAGGAAATCAGGGACTGCGCATTGAT 1560 
EMTARELAYNAPGNQ GLRI D 
TACGCGAGTGTGTTACAAGCCCGCGGTTGGCCTCGTGCAGCAGAAAATGAATT AAAAAAA 1620 
YASVLQARGWPRAAENELKK 
GCAGAAGTGATCGAGCCACGTAATATTAATCTGGAGGTTGAACAAGCCTGGACAGCATTA 1680 
AEVIEPRNIN LEVEQAWTAL 
ACGTTACAAGAATGGCAGCAGGCAGCTGTCTTAACGCACGATGTTGTCGAACGTGAACCG 17 40 
TLQEWQQAAVLT HDVVERE P 
CAAGATCCCGGCGTTGTACGATTAAAACGTGCGGTTGATGTACATAATCTTGCAGAGCTT 1800 
QD PGVV RLKRAVD VHNLAEL 
CGTATCGCTGGCTCAACAGGAATTGATGCCGAAGGCCCGGATAGTGGTAAACATGATGTC 18 60 
RIAGSTG IDAEGP D SGKHDV 
GACTTAACCACCATCGTTTATTCACCACCGCTGAAGGATAACTGGCGCGGTTTTGCTGGA 1920 
DLTTIVYSPPLKDNWRGFAG 
TTCGGTTATGCCGATGGACAATTTAGCGAAGGAAAAGGGATTGTTCGCGACTGGCTTGCG 1980 
FGYAbGQ FSEGKGIVRDWLA 
GGTGTTGAGTGGCGGTCACGTAATATCTGGCTCGAGGCAGAGTACGCTGAACGCGTTTTC 2040 
GV EW RS R N IWLEAE YAERVF 
AATCATGAGCATAAACCCGGCGCGCGCCTGTCTGGCTGGTATGATTTTAATGATAACTGG 2100 
NH EHKPGARLS.GWYDFNDNW 
CGTATTGGTTCGCAACTGGAACGCCTCTCTCACCGCGTTCCATTACGGGCAATGAAAAAT 2160 
R I G S Q L E R L S H RV P L RAMKN 
GGTGTTACAGGCAACAGTGCTCAGGCTTATGTTCGCTGGT ATCAAAATGAGCGGCGT AAG 2220 
G V T G N S A Q A Y V R WY Q N E R RK 
TACGGTGTCTCCTGGGCTTTCACTGATTTTTCCGACAGTAACCAGCGTCATGAAGTCTCA 2280 
YGVS WAFTDFSDSNQRHEVS 
CTTGAGGGTCAGGAACGCATCTGGTCTTCACCATATTTGATTGTCGATTTCCTACCCAGT 2340 
L E G QE R IWS S P Y L I VD FLP S 
CTGTATTACGAACAAAATACAGAACACGATACCCCATACTACAACCCTATAAAAACGTTC 2400 
LYYE QNTEH DT PY YNPIK TF 
GATATTGTTCCGGCATTTGAGGCAAGCCATTTGTT ATGGCGAAGCTATGAAAATAGCTGG 2460 
D I VPAFEAS HL LWRS YEN SW 
GAGCAAATATTCAGCGCAGGTGTTGGTGCCTCCTGGCAAAAACATTATGGCACGGATGTG 2520 
E Q I F S A G V G A S W Q K R Y G T D V 
GTCACCCAACTCGGCTACGGGCAACGCATTAGTTGGAATGACGTGATTGATGCTGGCGCA 2580 
VTQLGYGQRI SW NDVI D A G A 
ACGGTACGCTGGGAAAAACGACCTTATGACGGTGACAGAGAACACAACTtATACGTTGAA 2640 
T LRWEKRPY D GDRE HN LYVE 
TTCGATATGACATTCAGATTTTAAGGATAAATATGTTACGTAATGGAAATAAATATCTCC 2700 

F D M T F R F * 

2. INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

a) LENGTH: 2031 

b) TYPE: 

c) STRANDEDNESS: 

d) TOPOLOGY: 

(ii) MOLECULE TYPE: Combined DNA and Amino Acid Sequences 
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(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(vii) IMMEDIATE SOURCE: 

(viii) POSITION IN GENOME: 

a) CHROMOSOME/SEGMENT : 

b) MAP POSITION: 

c) UNITS: 

(ix) FEATURE: 

a) NAME/KEY: 

b) LOCATION: 

c) IDENTIFICATION METHOD: 

d) OTHER INFORMATION: 

( X ) PUBLICATION INFORMATION: 

a) AUTHOR (S): 

b) TITLE: 

c) JOURNAL: 

d) VOLUME: 

e) ISSUE; 

f ) PAGE(S) : 

g) DATE; 

h) DOCUMENT NUMBER: 

i) FILING DATE: 

j) PUBLICATION DATE: 

k) RELEVANT RESIDUES IN SEQUENCE ID NO: 

(xi) SEQUENCE DESCRIPTION: SEQUENCE ID NO: 2 



TTAAGGATAAATATGTTACGTAATGGAAATAAATATCTCCTGATGCTGGTGAGTATAATT 

M LRNG NKYLLMLVSII 
ATGCTCACCGCGTGCATTAGCCAGTCAAGAACATCATTTATACCGCCACAGGATCGCGAA 

MLTAC I SQSRT SFI PPQDRE 
TCTTTACTCGCCGAGCAACCGTGGCCGCATAATGGTTTTGTAGCGATTTCATGGCATAAC 
SLLAEQPWPHNGFV AI S WHN 
GTTGAAGACGAAGCTGCCGACCAGCGTTTTATGTCAGTGCGGACATCAGCACTGCGTGAA 
VED EAADQRFM SVRTSALRE 
CAATTTGCCTGGCTGCGCGAGAACGGTTATCAACCGGTCAGTATTGCTCAAATTCGTGAA 
O F A W L R E N G Y Q P V S I A Q I R E 
GCACATCGAGGAGGAAAACCGCTACCGGAAAAAGCTGTAGTGCTGACTTTTGATGACGGC 

AHRGGKP L PE KAVVLT FDDG 
TACCAGAGTTTTTATACCCGCGTCTTCCCAATTCTTCAGGCCTTCCAGTGGCCTGCTGTA 420 

Y OS FYTRV F PI LQAFQWPAV 
TGGGCCCCCGTCGGCAGTTGGGTCGATACGCCAGCGGATAAACAAGTAAAATTTGGCGAT 4 9 0 

W APVGSWV D T p A D K Q V K F G D 
GAGTTGGTCGATCGAGAATATTTTGCCACGTGGCAACAAGTGCGAGAAGTTGCGCGTTCC 
ELV D REY FA T WQQVREVARS 
CGGCTCGTTGAGCTCGCTTCTCATACATGGAATTCTCACTACGGTATTCAGGCTAATGCC 

R LVELAS H TWN S H Y G I QAN A 
ACCGGCAGCTTATTGCCTGTATATGTAAATCGTGCATATTTTACTGACCACGCACGGTAT 



60 
120 
180 
240 
300 
360 



540 
600 
660 
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T G S LLPVYVNRAYFTDHARY 

GAAACCGCAGCAGAATACCGGGAAAGAATTCGTCTGGATGCTGTAAAAATGACGGAATAC 

ETAAEYRER I RL-DAVKMTEY 

CTGCGTACAAAGGTTGAGGTAAATCCACACGTTTTTGTTTGGCCTTATGGCGAAGCGAAT 

LRTKVEVNPHVFVWPYGEAN 

GGCATAGCGATAGAGGAATTAAAAAAACTCGGTTATGACATGTTCTTCACCCTTGAATCA 

GIAIEELKKLGYDMFFTLES 

GGTTTGGCAAATGCGTCGCAATTGGATTCCATTCCGCGGGTATTAATCGCCAATAATCCC 

GLANASQLDSI PRVLIANNP 

TCATTAAAAGAGTTTGCCCAGCAAATTATTACCGTACAGGAAAAATCACCACAACGGATA 

SLKEFAQQI ITVQEKSPQRI 

ATGCATATCGATCTTGATTACGTTTATGACGAAAACCTCCAGCAAATGGATCGCAATATT 

MHI DLDYVY DENLQQMDRNI 

GATGTGCTAATTCAGCGGGTGAAAGATATGCAAATATCAACCGTGTATTTGCAGGCATTT 

D VL I Q RV K DMQ I S TV Y L QA F 

GCTGATCCCGATGGTGATGGGCTGGTCAAAGAGGTCTGGTTTCCAAATCGTTTGCTACCA 

ADP DG D GLVKEVW FPNRLLP 

ATGAAAGCAGATATTTTTAGTCGGGTTGCCTGGCAATTACGTACCCGCTCAGGTGTAAAC 

MKADI FS RVAWQLRTRSGVN 

ATCTATGCGTGGATGCCGGTATTAAGCTGGGATTTAGATCCCACATTAACGCGAGTAAAA 

I YAWM PVLS WD LDPT LT RVK 

TACTTACCAACAGGGGAGAAAAAAGCACAAATTCATCCTGAACAATATCACCGTCTCTCT 

YLPTGEKKAQ I HPE QYHRLS 

CCTTTCGATGACAGAGTCAGAGCACAAGTTGGCATGTTATATGAAGATCTTGCCGGACAT 

PFDD .RV RAQV GM LYEDLAGH 

GCTGCTTTTGATGGCATATTGTTCCACGATGATGCTTTGCTTTCAGATTATGAAGATGCC 

AA F DG I L FH DDALLS DY E DA 

AGTGCACCGGCTATCACGGCTTATCAGCAAGCAGGCTTTAGCGGGAGTCTGAGCGAAATT 

SAPAI TAYQQ AGFSGSLSEI 

CGACAAAACCCGGAGCAATTTAAACAGTGGGCCCGCTTTAAAAGTCGTGCGTTAACTGAC 

RQ N PEQFKQWARFKS RALT D 

TTCACTTTAGAACTTAGTGCGCGCGTAAAAGCCATTCGCGGTCCACATATTAAAACTGCA 

F TL E L SARV KA I RGP H I KTA 

CGAAATATTTTTGCACTTCCGGTAATACAACCTGAAAGTGAAGCCTGGTTTGCACAGAAT 

RN I FA LPV I QP E SEA W FA Q N 

TATGCTGATTTCCTAAAAAGCTATGACTGGACCGCTATTATGGCTATGCCTTATCTGGAA 

Y A D F L K S Y D W T A I MAM P Y L E 

GGTGTCGCAGAAAAATCGGCTGACCAATGGTTAATACAATTGACCAATCAAATTAAAAAC 

G VAEKS ADQWL .IQLTN QIKN 

ATCCCTCAGGCT AAAGACAAATCT AT T TT AGAATT ACAGGCACAAAACTGGCAGAAAAAT 

I P Q A K D K S ILELQAQ NWQKN 

GGTCAGCATCAGGCTATTTCTTCGCAACAACTCGCTCACTGGATGAGCCTATTACAACTG 

G Q HQ AISSQQLAHWMS LLQL 

AATGGAGTGAAAAACTATGGTTATTATCCCGACAATTTTCTGCATAACCAACCTGAAATA 

NGVKN YGYYPDNFLHNQPEI 

GACCTTATTCGTCCTGAGTTTTCAACAGCCTGGTATCCGAAAAATGATTAA 

D L I RPEFS TAWYPKND *** 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
14 40 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
- 1920 
1980 
2031 



(YCDR STOP CODON) 
(YCDQ START CODON) 



2. 



INFORMATION FOR SEQ ID NO: 3 



(i) 



SEQUENCE CHARACTERISTICS: 



a) 
b) 
c) 
d) 



LENGTH: 1560 
TYPE: 

STRANDEDNESS : 
TOPOLOGY: 



(ii) 



MOLECULE TYPE: Combined DNA and Amino Acid Sequences 
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(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(vii) IMMEDIATE SOURCE: 

(viii) POSITION IN GENOME: 

a) CHROMOSOME/SEGMENT: 

b) MAP POSITION: 

c) UNITS: 

(ix) FEATURE: 

a) NAME /KEY: 

b) LOCATION: 

c) IDENTIFICATION METHOD: 

d) OTHER INFORMATION: 

< x ) PUBLICATION INFORMATION: 

a) AUTHOR (S) : 

b) TITLE: 

c) JOURNAL: 

d) VOLUME: 

e) ISSUE: 

f) PAGE(S): 

g) DATE: 

h) DOCUMENT NUMBER: 

i) FILING DATE: 

j) PUBLICATION DATE: 

k) RELEVANT RESIDUES IN SEQUENCE ID NO: 

< xi > SEQUENCE DESCRIPTION: SEQUENCE ID NO: 3 

AAAATGATTAATCGCATCGTATCGTTTTTTATATTATGTCTGGTGTTATGCATACCCCTA 240 

■M-INRIVSFP1LCLVLCIPL 
TGCGTAGCGTACTTTCACTCTGGTGAACTGATGATGAGGTTCGTTTTCTTCTGGCCGTTT 300 
CVAYFH SGELMMRFVFFWPF 
TTTATGTCCATTATGTGGATTGTTGGCGGCGTCTATTTCTGGGTCTATCGTGAACGCCAC 360 
F M S I MW I V G G V Y FWV Y R E R H 

TGGCCGTGGGGAGAAAACGCACCAGCTCCCCAGTTGAAAGATAATCCGTCTATCTCCATT 420 
W PWGE NAPAPQ LKDNPS I SI 
ATCATTCCCTGTTTTAATGAGGAGAAAAACGTTGAGGAAACCATACACGCCGCTTT AGCA 480 
I I P C F N E E KN V E E T I H AA LA 

CAGCGTTATGAGAACATTGAAGTTATTGCCGTAAATGACGGTTCAACAGATAAAACCCGT 5 4 0 
QRYENIE VIAVN DGST DKTR 
GCCATCCTGGATCGCATGGCTGCACAAATTCCCCATTTGCGGGTCATTCATCTGGCGCAA 600 
AILDRMAAQIPHLRVI HLAQ 
AACCAGGGGAAAGCCATTGCGCTTAAAACCGGAGCTGCCGCGGCGAAAAGTGAATATCTG 660 
N QGKA I A LKTGA AAAKS E YL 
GTGTGCATTGATGGCGATGCGTTATTAGACCGCGATGCGGCGGCATATATTGTGGAACCG 720 
V C I D G D A L L D R D A A A Y I V E P 

ATGTTGTACAACCCGCGTGTGGGTGCCGTAACCGGTAATCCTCGTATTCGAACACGTTCT 7 8 0 
M L Y N P RV .GAVT G.N P R I R T R S 
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1500 
1560 



ACCCTGGTGGGTAAAATTCAGGTTGGCGAGTATTCCTCAATTATTGGTTTGATCAAGCGA 840 

TLVGKIQVGEYS S I IGLI KR 
ACCCAGCGTATCTATGGAAACGTATTTACCGTTTCCGGTGTTATTGCCGCATTTCGTCGC 900 
TQRI YGNVFTVSGVIAAFRR 
AGCGCCCTGGCAGAAGTGGGTTACTGGAGTGACGATATGATCACCGAAGATATTGATATT 960 
SALAEVGYWSDDMITEDIDI 
AGCTGGAAGCTGCAGTTGAATCAGTGGACGATTTTTTACGAGCCACGGGCACTGTGCTGG 1020.. 
SWKLQLNQWTIFYEPRALCW 
ATATTAATGCCTGAAACGTTAAAAGGGCTGTGGAAACAGCGCCTGCGCTGGGCTCAGGGC 1080 
I LMPETLKGLWKQRLRWAQG 
GGTGCAGAAGTATTCCTCAAAAAT ATGACAAGGTTGTGGCGCAAAGAAAACTTTCGAATG 114 0 
GAEVFL KNMTRLWRKENFRM 
TGGCCGCTGTTTTTTGAATACTGCCTGACGACAATATGGGCCTTCACCTGCCTGGTCGGT 1200 
WPLFFEYCLTTIWAFTCLVG 
TTCATTATTTACGCAGTCCAACTTGCCGGTGTACCGTTAAATATTGAATTGACACATATC 1260 
FI IYAVQLAGVPLNIELTHI 
GCTGCGACACATACTGCCGGAATATTATTGTGTACGTTATGTTTACTGCAATTTATTGTC 1320 
AAT HTAG ILLCTLCLLQF I. V 
AGCCTGATGATCGAGAATCGCTATGAGCATAATCTGACTTCATGGCTTTTCTGGATTATT 1380 
SLMIENRYEHNLTSSLFWII 
TGGTTCCCGGTTATTTTCTGGATGCTGAGCCTGGCAACGACATTGGTATCATTTACACGA 1440 
W FPVI FW MLSLATTLVS FT R 
GTCATGTTGATGCCTAAAAAGCAACGCGCCCGTTGGGTAAGTCCCGATCGCGGGATTCTG 

V M h M P K KQ RA RW V S P D RG I L 
AGAGGTTAATATGAACAATTTAATTATTACGACCCGACAATCACCAGTACGTTTACTGGT 

RG* M N N L (ycdP) 

2. INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS: 

a) LENGTH: 30 

b) TYPE: 

c) STRANDEDNESS : 

d) TOPOLOGY: 

(il) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 
( V ) FRAGMENT TYPE: 
( V i) ORIGINAL SOURCE: 

(vii) IMMEDIATE SOURCE: 

(viii) POSITION IN GENOME: 

a) CHROMOSOME /SEGMENT: 

b) MAP POSITION: 

c) UNITS: 

(ix) FEATURE i 

a) NAME /KEY: 

b) LOCATION: 

c) IDENTIFICATION METHOD: 



d) OTHER INFORMATION: 

( X ) PUBLICATION INFORMATION: 

a) AUTHOR(S): 

b) TITLE: 

c) JOURNAL: 

d) VOLUME: 

e) ISSUE: 

f) PAGE(S): 

g) DATE: 

h) DOCUMENT NUMBER: 

i) FILING DATE: 

j) PUBLICATION DATE: 

k) RELEVANT RESIDUES IN SEQUENCE ID NO 

( X i) SEQUENCE DESCRIPTION: SEQUENCE ID NO: 4 

TACAGTTAAG TGTGTTATCG GTGCAGAGCC 
2. INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS: 

a) LENGTH: 31 

b) TYPE: 

c) STRANDEDNESS : 

d) TOPOLOGY: 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 
( V ) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(vii) IMMEDIATE SOURCE: 

(viii) POSITION IN GENOME: 

a) CHROMOSOME/ SEGMENT: 

b) MAP POSITION: 

c) UNITS: 

(ix) FEATURE: 

a) NAME /KEY: 

b) LOCATION: 

c) IDENTIFICATION METHOD: 

d) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

a) AUTHOR(S) : 

b) TITLE: 

c) JOURNAL: 

d) VOLUME: 
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e) ISSUE: 

f) PAGE(S): 

g) DATE: 

h) DOCUMENT NUMBER: 

i) FILING DATE: 

j) PUBLICATION DATE: 

k) RELEVANT RESIDUES IN SEQUENCE ID NO: 



(xi) SEQUENCE DESCRIPTION: SEQUENCE ID NO: 5 

CTCAACGCCT GGCTGATTAA ACCAACTATT C 31 
2. INFORMATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS: 



a) LENGTH: 7500 

b) TYPE: 

c) STRANDEDNESS : 

d) TOPOLOGY: 



(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(vii) IMMEDIATE SOURCE: 

(viii) POSITION IN GENOME: 

a ) CHROMOSOME/ S EGMENT : 

b) MAP POSITION: 

c) UNITS: 

(ix) FEATURE: 

a) NAME /KEY; 

b) LOCATION: 

c) IDENTIFICATION METHOD: 

d) OTHER INFORMATION : 

(x) PUBLICATION INFORMATION: 



a) 


AUTHOR (S) : 


b) 


TITLE: 


c) 


JOURNAL: 


d) 


VOLUME: 


e) 


ISSUE: 


f) 


PAGE(S) : 


g) 


DATE: 


h) 


DOCUMENT NUMBER: 


i) 


FILING DATE: 


j) 


PUBLICATION DATE: 


k) 


RELEVANT RESIDUES IN SEQUENCE ID NO: 
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( X i) SEQUENCE DESCRIPTION: SEQUENCE ID NO: 6 

ATGTATTCAAGTAGCAGAAAAAGGTGCCCGAAAACCAAATGGGCTTTGAAACTTCTTACT 

GCCGCATTTTTAGCAGCGAGTCCCGCGGCGAAGAGTGCTGTTAATAACGCCTATGATGCA 

TTGATTATTGAAGCTCGCAAGGGTAATACTCAGCCAGCTTTGTCATGGTTTGCACTAAAA 

TCAGCACTCAGCAATAACCAAATTGCTGACTGGTTACAGATTGCCTTATGGGCCGGGCAA 

GATAAACAGGTTATTACCGTTTACAACCGCTACCGTCATCAGCAATTACCAGCGCGTGGT 

TATGCAGCTGTCGCCGTCGCTTATCGTAACCTGCAACAATGGCAAAACTCGCTTACACTG 

389 

TGGCAAAAGGCGCTCTCTCTGGAGCCGCAAAATAAGGATTATCAACGGGGACAAATTTTA 

ACCCTGGCAGATGCTGGTCACTATGATACTGCGCTGGTTAAACTTAAGCAGCTTAACTCT 

GGAGCACCGGACAAAGCCAATTTACTCGCAGAAGCCTATATCTATAAACTGGCGGGGCGT 

583 

CAT CAGGAT G AATT ACGGGCG ATG ACAGAG TC AT TACCT G AAAATGCATCT ACGCAACAA 



300 



600 



1200 



TATCCCACAGAATACGTGCAGGCATTACGTAATAATCAACTTGCTGCCGCGATTGACGAT 

GCCAATTTAACGCCAGATATTCGCGCTGATATTCATGCCGAACTGGTCAGACTGTCGTTT 

ATGCCTACGCGCAGTGAAAGTGAACGTTATGCCATTGCCGATCGCGCCCTCGCCCAATAC 

GCTGCATTAGAAATTCTGTGGCACGATAACCCAGACCGCACTGCCCAGTACCAGCGTATT 

CAGGTTGATCATCTTGGCGCGTTATTAACTCGCGATCGTTATAAAGACGTTATTTCTCAC 900 

TATCAGCGATTAAAAAAGACGGGGCAAATTATTCCGCCCTGGGGGCAATATTGGGTTGCA 

tcggcttatctcaaagatcatcagccgaaaaaagcacagtcaataatgaccgagctcttt 

TATCACAAGGAGACCATTGCCCCGGATTTATCCGATGAAGAACTTGCGGATCTCTTTTAC 

agccacctggagagtgaaaattatccgggcgcgctaactgtcacccaacataccattaat 

ACTTCGCCGCCTTTCCTTCGGTTAATGGGCACGCCTACGAGCATCCCGAATGATACCTGG 
TTACAGGGGCATTCGTTTCTCTCAACCGTAGCAAAATATAGTAATGATCTTCCTCAGGCT 
GAAATGACAGCCAGAGAGCTTGCTTATAACGCACCAGGAAATCAGGGACTGCGCATTGAT 
TACGCGAGTGTGTTACAAGCCCGCGGTTGGCCTCGTGCAGCAGAAAATGAATTAAAAAAA 
GCAGAAGTGATCGAGCCACGTAATATTAATCTGGAGGTTGAACAAGCCTGGACAGCATTA 
ACGTTACAAGAATGGCAGCAGGCAGCTGTCTTAACGCACGATGTTGTCGAACGTGAACCG 
CAAGATCCCGGCGTTGTACGATTAAAACGTGCGGTTGATGTACATAATCTTGCAGAGCTT 
CGTATCGCTGGCTCAACAGGAATTGATGCCGAAGGCCCGGATAGTGGTAAACATGATGTC 
GACTTAACCACCATCGTTTATTCACCACCGCTGAAGGATAACTGGCGCGGTTTTGCTGGA 
TTCGGTTATGCCGATGGACAATTTAGCGAAGGAAAAGGGATTGTTCGCGACTGGCTTGCG 



1500 
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GGTGTTGAGTGGCGGTCACGTAATATCTGGCTCGAGGCAGAGTACGCTGAACGCGTTTTC 1800 

AATCATGAGCATAAACCCGGCGCGCGCCTGTCTGGCTGGTATGATTTTAATGATAACTGG 

CGTATTGGTTCGCAACTGGAACGCCTCTCTCACCGCGTTCCATTACGGGCAATGAAAAAT 

GGTGTTACAGGCAACAGTGCTCAGGCTTATGTTCGCTGGTATCAAAATGAGCGGCGTAAG 

TACGGTGTCTCCTGGGCTTTCACTGATTTTTCCGACAGTAACCAGCGTCATGAAGTCTCA 

CTTGAGGGTCAGGAACGCATCTGGTCTTCACCATATTTGATTGTCGATTTCCTACCCAGT 2100 

CTGTATTACGAACAAAATACAGAACACGATACCCCATACTACAACCCTATAAAAACGTTC 

GATATTGTTCCGGCATTTGAGGCAAGCCATTTGTTATGGCGAAGCTATGAAAATAGCTGG 

GAGCAAATATTCAGCGCAGGTGTTGGTGCCTCCTGGCAAAAACATTATGGCACGGATGTC 

GTCACCCAACTCGGCTACGGGCAACGCATTAGTTGGAATGACGTGATTGATGCTGGCGCA 

ACGCT ACGCTGGGAAAAACG ACCTT ATGACGGTGACAGAGAACACAACTTATACGTTGAA 2400 

ycdR(+l> 

TTCGAT ATGAC AT TCAG AT T T TAAGGAT AAAT ATG TTACG TAATGG AAAT AAATATC TCC 

TGATGCTGGTGAGTATAATTATGCTCACCGCGTGCATTAGCCAGTCAAGAACATCATTTA 

TACCGCCACAGGATCGCGAATCTTTACTCGCCGAGCAACCGTGGCCGCATAATGGTTTTG 

TAGCGATTTCATGGCATAACGTTGAAGACGAAGCTGCCGACCAGCGTTTTATGTCAGTGC 

GGACATCAGCACTGCGTGAACAATTTGCCTGGCTGCGCGAGAACGGTfATCAACCGGTCA 2700 

GTATTGCTCAAATTCGTGAAGCACATCGAGGAGGAAAACCGCTACCGGAAAAAGCTGTAG 

TGCTGACTTTTGATGACGGCTACCAGAGTTTTTATACCCGCGTCTTCCCAATTCTTCAGG 

CCTTCCAGTGGCCTGCTGTATGGGCCCCCGTCGGCAGTTGGGTCGATACGCCAGCGGATA 

AACAAGTAAAATTTGGCGATGAGTTGGTCGATCGAGAATATTTTGCCACGTGGCAACAAG 

TGCGAGAAGTTGCGCGTTCCCGGCTCGTTGAGCTGGCTTCTCATACATGGAATTCTCACT 3000 

ACGGTAT TCAGGCTAATGCCACCGGCAGCTTAT TGCCTGTATATG T AAATCGTGCATATT 

TTACTGACCACGCACGGTATGAAACCGCAGCAGAATACCGGGAAAGAATTCGTCTGGATG 

723 

CTGTAAAAATGACGGAATACCTGCGTACAAAGGTTGAGGTAAATCCACACGTTTTTGTTT 

GGCCTTATGGCGAAGCGAATGGCATAGCGATAGAGGAATTAAAAAAACTCGGTTATGACA 

TGTTCTTCACCCTTGAATCAGGTTTGGCAAATGCGTCGCAATTGGATTCCATTCCGCGGG 3300 

TATTAATCGCCAATAATCCCTCATTAAAAGAGTTTGCCCAGCAAATTATTACCGTACAGG 

AAAAATCACCACAACGGATAATGCATATCGATCTTGATTACGTTTATGACGAAAACCTCC 

AGCAAATGGATCGCAATATTGATGTGCTAATTCAGCGGGTGAAAGATATGCAAATATCAA 

CCGTGTATTTGCAGGCATTTGCTGATCCCGATGGTGATGGGCTGGTCAAAGAGGTCTGGT 
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TTCCAAATCGTTTGCTACCAATGAAAGCAGATATTTTTAGTCGGGTTGCCTGGCAATTAC 3600 

GTACCCGCTCAGGTGTAAACATCTATGCGTGGATGCCGGTATTAAGCTGGGATTTAGATC 

CCACATTAACGCGAGTAAAATACTTACCAACAGGGGAGAAAAAAGCACAAATTCATCCTG 

AACAATATCACCGTCTCTCTCCTTTCGATGACAGAGTCAGAGCACAAGTTGGCATGTTAT 

ATGAAGATCTTGCCGGACATGCTGCTTTTGATGGCATATTGTTCCACGATGATGCTTTGC 

TTTCAGATTATGAAGATGCCAGTGCACCGGCTATCACGGCTTATCAGCAAGCAGGCTTTA 3900 

GCGGGAGTCTGAGCGAAATTCGACAAAACCCGGAGCAATTTAAACAGTGGGCCCGCTTTA 

AAAGTCGTGCGTTAACTGACTTCACTTTAGAACTTAGTGCGCGCGTAAAAGCCATTCGCG 

GTCCACATATTAAAACTGCACGAAATATTTTTGCACTTCCGGTAATACAACCTGAAAGTG 

AAGCCTGGTTTGCACAGAATTATGCTGATTTCCTAAAAAGCTATGACTGGACCGCTATTA 

TGGCTATGCCTTATCTGGAAGGTGTCGCAGAAAAATCGGCTGACCAATGGTTAATACAAT 4200 

TGACCAATCAAATTAAAAACATCCCTCAGGCTAAAGACAAATCTATTTTAGAATTACAGG 

CACAAAACTGGCAGAAAAATGGTCAGCATCAGGCTATTTCTTCGCAACAACTCGCTCACT 

GGATGAGCCTATTACAACTGAATGGAGTGAAAAACTATGGTTATTATCCCGACAATTTTC 

TGCATAACCAACCTGAAATAGACCTTATTCGTCCTGAGTTTTCAACAGCCTGGTATCCGA 
ycdQ(+l) 

AAAATGATTAATCGCATCGTATCGTTTTTTATATTATGTCTGGTGTTATGCATACCCCTA 4 500 

TGCGTAGCGTACTTTCACTCTGGTGAACTGATGATGAGGTTCGTTTTCTTCTGGCCGTTT 

TTTATGTCCATTATGTGGATTGTTGGCGGCGTCTATTTCTGGGTCTATCGTGAACGCCAC 

TGGCCGTGGGGAGAAAACGCACCAGCTCCCCAGTTGAAAGATAATCCGTCTATCTCCATT 

ATCATTCCCTGTTTTAATGAGGAGAAAAACGTTGAGGAAACCATACACGCCGCTTTAGCA 

CAGCGT T ATGAGAACATTGAAGT TAT T GCCGTAAATGACGGTTCAACAGATAAAACCCGT 4800 

GCCATCCTGGATCGCATGGCTGCACAAATTCCCCATTTGCGGGTCATTCATCTGGCGCAA 

AACCAGGGGAAAGCCATTGCGCTTAAAACCGGAGCTGCCGCGGCGAAAAGTGAATATCTG 

GTGTGCATTGATGGCGATGCGTTATTAGACCGCGATGCGGCGGCATATATTGTGGAACCG 

ATGTTGTACAACCCGCGTGTGGGTGCCGTAACCGGTAATCCTCGTATTCGAACACGTTCT 

ACCCTGGTGGGTAAAATTCAGGTTGGCGAGTATTCCTCAATTATTGGTTTGATCAAGCGA 5100 

ACCCAGCGTATCTATGGAAACGTATTTACCGTTTCCGGTGTTATTGCCGCATTTCGTCGC 

AGCGCCCTGGCAGAAGTGGGTTACTGGAGTGACGATATGATCACCGAAGATATTGATATT 

AGCTGGAAGCTGCAGTTGAATCAGTGGACGATTTTTTACGAGCCACGGGCACTGTGCTGG 

ATATTAATGCCTGAAACGTTAAAAGGGCTGTGGAAACAGCGCCTGCGCTGGGCTCAGGGC 
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GGTGCAGAAGTATTCCTCAAAAATATGACAAGGTTGTGGCGCAAAGAAAACTTTCGAATG 54 00 

TGGCCGCTGTTTTTTGAATACTGCCTGACGACAATATGGGCCTTCACCTGCCTGGTCGGT 

TTCATTATTTACGCAGTCCAACTTGCCGGTGTACCGTTAAATATTGAATTGACACATATC 

GCTGCGACACATACTGCCGGAATATTATTGTGTACGTTATGTTTACTGCAATTTATTGTC 

AGCCTGATGATCGAGAATCGCTATGAGCATAATCTGACTTCATCGCTTTTCTGGATTATT 

TGGTTCCCGGTTATTTTCTGGATGCTGAGCCTGGCAACGACATTGGTATCATTTACACGA 5700 

GTCATGTTGATGCCTAAAAAGCAACGCGCCCGTTGGGTAAGTCCCGATCGCGGGATTCTG 
ycdP(+l) 

AGAGGTTAATATGAACAATTTAATTATTACGACCCGACAATCACCAGTACGTTTACTGGT 

TGATTATGTTGCCACAACCATCTTGTGGACATTATTTGCGTTGTTCATATTCTTATTCGC 

CATGGATCTGCTGACGGGTTATTACTGGCAAAGCGAGGCCAGAAGCCGACTTCAGTTCTA 

TTTTTTGCTGGCAGTGGCGAATGCCGTCGTGTTAATTGTCTGGGCGCTGTACAATAAGCT 6000 

GCGTTTTCAAAAACAGCAGCATCATGCAGCCTACCAATATACGCCGCAAGAATATGCAGA 

GAGCTTAGCAATACCTGATGAGCTCTATCAGCAACTACAAAAAAGCCACAGGATGAGCGT 

ACACTTCACCAGCCAGGGGCAAATAAAAATGGTTGTTTCAGAAAAAGCGCTAGTCCGGGC 

ATAAACACCCAAAACAAAGCCCGGTTCGCCCGGGCTCTGCACCGATAACACACTTAACTG 

TAGGCATGCAGCGTACGTTGGCAAAGTGCCGAACGTACGCAGTCCTCTTTACCGAACCGG 6300 

ACGATCCCAACCATTTCATCTTCTTCGAAACGTTCCAGCGCGTCACTTAATCCGGAGCAC 

ACGCCGCGAGGCAAATCGCATTGCGTGATATCACCGTTGACGATAACCGTCACGTTCTCG 

CCGAGGCGGGTTAAAAACATTTTCATTTGCGCGGCAGTCAGATTCTGCGCCTCGTCAAGA 

ATGACGACTGCATTTTCAAAGGTACGTCCACGCATATAGGCGAACGGCGCAATTTCCACC 

TTCCCTATTTCCGGTCGCAGGCAGTACTGCATAAAGGAAGCCCCTAAGCGCCGGACCAGC 6600 

ACGTCGTAGACCGGGCGAAAATAGGGAGCAAACT T TTCTGCGAT ATCTCCAGGTAAGAAG 

CCAAGATCTTCATCGGCTTGCAGAACTGGACGGGTGACGATAATCCTGTCGACATCCTTA 

TGTATCAGGGCCTCTGCCGCTTTTGCTGCGCTGATCCAGGTTTTTCCGCACCCGGCTTCG 

CCCGTGGCGAATATCAGCTGCTTACTCTCAATAGCCTTCAGATAGTGCAATTGCGCTTCA 

TTTCGCGCGAGGATGGGCGAAGTATCGCGACTGTCGCGGGCCATACCAATGGCTTCTACG 69.00 

CCGCCCATCTGCACAAGCGAGGTGACCGATTCTTCTTCACGCTGCTTATGGCTGCGCGAA 

TCCCGTCTCAGCACACGTTTTGCCTCGCGACGAGCTTTGATCACTGCTTTTTGTCTTCCC 

ATGGAGAGCACCTTGAGTTGTTTGTATTCATCACACGCGCCGTTGGCAGCGCGATTATGC 

GCACGAACATCAGAGGGTTGGCTTCCTTGTAAGCCATAGTTTGCTTTTGGATAAAATGCC 
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GAAAAACGGCTACGCGCACCGTTTACGGCGTCGGTAACACATGAAAAGAAAGGATGAGGT 7200 

TGAAAATGCAAAGTGACGAGATGACTACCGGAGGAGAAAACTCCGCGAGTGGTGGCGCGT 

TGATTATCTAAAACATGTCCAGTACAGGACGTTACCATCCGCGATCTCCATAGTGACTGA 

CTATCACTGCCGGGAACTTCCGCTGCTACTTAATAAGTACAACAGATCTCGCATTTATTG 

CAACAATATATTTACTTATATTTAACTATAAAACACCATTTCAGTGACATTAGTTTCTAC 

TGGAAAGATGACAGAGTGATGACAGTGATGAAAAAAGCTGTGTGCTTTCAGCAGGATTTG 7500 
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